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ABSTRACT 

This paper reviews 20 cross-cultural studies 
conducted with the. Defining Issues Test (DIT) . All or part ,of the 
samples in the studies were norr- Americans who had not been a part of 
,th2 populations presented in tfle DIT manual. The studies were aimfrd 
at the comparison of moral reasoning structure and development .Across 
cultures. Several aspects ot the findings were reviewed:, (1) the 
psychometric properties of. cultural versions of the$DIT; (2) effects 
of the examinee's ethnic, background; (3) age and education trends; 
(4) sex differences; (5) correlations with other psychological tests; 
(6) religious differences; (7) urban-rural milieu; (8) delinquent 
behavior; and (9) familial and social factors. The studies reviewed 
in this report satisfy the evaluation criterion of the 
generalizability of psychological theories developed in one culture 
to another culture. The DIT seems to have cross-cultural validity in 
detecting moral reasoning structure and its development in cultures 
outside the United States. The validation and explanation roles of 
the DIT in cross-cultural studies should be pursUed together. 
(DWH) 
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A Review of Cross Cultural Studies On Moral Judgment- 
Development Using The Defining Issues Test 




I. Introduction 



Hundreds of studies .have been conducted using Rest's 
Defining Issues Test < Moon , • 1984 ) . Among the, studi es , ; 
crqss-cultural studies are frequently found. In this p&p£r, 20 
croas-cul tural DIT studies which were available to .the author 
will be reviewed. They were regarded as "cross-cultural" in * 
that all or part, of the samples ^in the studies were non-American 
who. had not been a part of the populations presented in the DIT 
manual < Rest, 1979b) . * § 

i X 

The samples of the 20 studies Represent 15 cultures o^ 
countries: AuStraliaO) , BraziKl), GreeceCi), Hong Kong<2), 
Iceland(I), IndiaU), IsraeKl), Japan<2), KoreaU), Hexico(l), . 
PkilippinesCl) , Saudi ArabiaQ), South AfricaU), Taiwan<2L, and 
Trinidad and Tobago<l). All the studies but one were cross- 
sectional and M etic" in the sense that their aims were primarily 
at the comparison of moral reasoning structure and development 
across cultures. 

t 

0 Seven of the 19 etic studies ^mpBdyed two or more 
culturally different samples and attempted to test the 
hypothesis related to cultural background . The other 12 studies 
employed samples of the same cultural background and attempted 
to .compare the results to other already established results such* 
as shown in the DIT manual (Rest, 1979b / pp . 7 . r*- 7.18), for 
example. Their objectives vary from validation studies of the 
TIT to correlation and intervention studies, A variety of 
tactors were related to the DI r . scores: for example, age, 
education, sex, religion, region, s,cfbres from various 
psychological tests, familial/societal factors, and so on. This 
review will prQceed to discuss the cultural validation issues 
first and the factors later. Table 1 below presents a summary 
overview of the 20 studies. • 



Insert Table 1 here 



II. Review of the findings 



A. Psychometric properties of cultural versions of the DIT 

The equivalence of test stimuli is a basic requirement for 
ri comparison to be made and is the most vexing problem in 
cross cultural research < Predericsen , 1977). In DIT studies. 
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comparisons of the DIT* scores across cultures make sense only if 

cultural versions of the original DIT measure the same thing 

that the original DIT has measured in the American studies. Such 

equivalence could be addressed in. terms of psychometric 

properties, translation, and cultural adaptati on. , 

% f 

One of the important ways of establishing the equivalence 
is tg see if cultural versidns of ■ the .original DIT have«the same 
psychometric properties (e.g., reliability, validity / i tem 
structures , etc. ) and the same relationship with other 1 
convergent/divergent variables thctt the original DIT has ehown 
so f^r. First of all, the psychometric properties of the 
culturally adapted and/or translated versions of the DIT will\be 
ekamined before looking into other^resul ts of , the tests. 

Th& original DIT has quite high test-retest reliability * 
(.82 and .77 s for 6~ and 3-story form respectively) and internal 
consistency (.77 and .76 for frr and 3-stCrry form respectively . in 
terms of Cronbach's alpha).. The 3-story form tends tc show lower 
levels of reliability , and internal consistency. Four out of 19 
studies report reliability arxd/or internal consistency measures. 
By and large, test-retest stability and internal consistency of 
the translated t)ITs are somewhat low except the Australian DIT 
fDirkinson, 1979, r= . 98- .99) which is almost identical to the 
original DIT except for; a few changes in several words. In 
Korea^ Park^ J and Johnson, R,s study report a quite comparable 
(but a little lower) level of reliabili ty ( r= . 69) . I n the studies 
of Bzuneck (1978 , f- . 39) and Hau ( 1983, r= . 32) , the reliabilities 
are quite low. Therefore these Measures may not be successful 
adaptations of the original DIT. Only two studies report 
internal consi stency" measures which are somewhat lower 
bouruHDickirison, 1979, alpha=.66; Hau, 1983, alpha=.50) 
c 

The original DIT has shown consistently significant 
cor relations. with MorAl Maturity Score(fttflS) of Kohlberg's test 
at .43 - % .70 Ife^eiyand with the Law and Order Test at -.60 - 
-.58 level. Considering these measures are supposed to relate by- 
definition to the construct of P-score of the D T T, similar 
relationship can be expected in the translatedetic DITs If they 
are really equivalent to the original DIT. The P-score of the 

# (Thorl indsson , 1978) has negative correlation with 
Test at -.4498 level which compares to - . E>8 in the 
es (Rest, 1979b). Two Chinese studies (Tsaing, 
1980, r=. 486; ita , 1980r=.20 - .29) show that P-score of their 
Chinese DITs nave moderate correlations with a Kohlberg's 
measure. In *!a's study (1980), the correlation was mufch lower 
than those of the American and Tsaing's sample. But the high 
homogeni-ety of Ma's sample may be a cause. 

* In Hau's study (1983), the psychometric property of his 
Chinese DIT is extensively explored. It, therefore, is worth 
mentioning more in detail. First, he replicated HcGeorge's 
fakability experiment ( 197b) . The result confirmed that his 
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subjects were unable to fake good. Second , .he replicated some of 
Davison's analysis of the. DIT's stage* and internal structure 
< Davi son>* 1979) , using a' correlational matrix of stage scores 
and a factor analytic % raethod . He found asimplex ^pattern of 
intercorrelations . and a two factor struqture 9s demonst rated in 
Davison'study <even though'there was a minor reversal of stage - 
ordering). Third, he ' conducted ( an item analysis. A Good item 
should have the highest coiT^l^tion with its own stage score 
and^T^w correlations with other stage scores. His "results show 
that in 61 out of 72 itemfe^ the , correlations with their own 
stage scores were the highest. For the • remaining Jl wrongly 
keyed items, nine items did not have significant correlations 
with any of the 6 statjes, thud precluding them f roc* being 
reassigned to other stages"( p\ 70) . 

He al^e checked the Alpha level when £ach.item is deleted 
from its subscale.The result showed that all but 11 items have 
the highest correlation with its own subscale and 66 out of 72. 
items show decreased Alpha when they are deleted. It shows, by 
and large, that his DIT has comparable psychometric properties 
to that of the original DIT<p.70) 

Translation and cultural adaptation are also important 
factors affecting the equivalence of the . t ranslated and the 
original .DIT. The insensi tivi ty> to translation error is well • 
reflected in the studies, because information about translation 
is rarely reported and, when reported, ver^ superficially. Only 
three studies describe their translation method or procedure ' 
briefly: multiple translation (Thorl indsson , 1978), team and 
back translation (Benor etr al , 1978), and the bilingual raethod\ % 
(Hau,1983). The first two studies did not attempt to identify ) 
translation enrbr systematically. They just "tried their best". 
But in Hau's study (1983), .translation error or its effect was 
detected through the experimental method as suggested by Prince 
& Horabour ^1967) : "subjects were randomly divided into two 
groups. The first group took the first half of The DIT in ' m 
English and the second- half in Chinese ; the order was reversed 
for the second group. After two weeks, they were retested, using 
the tefit of the other group <p.34X" The result shows no 
significant difference in all scores <2, 3, 4,, %ar, 5b, 6, f and 
D-score) but- the P-score. It was interpreted that to the 
Chinese student, the P-score in the English version u£e an 
underestimation whefeas the scores in the Chinese version are 
the* subject's actual level of moral judgment. Therefore, it was 
contended that the difference in P-score is at least in part 
due to subjects deficiency in English. 

In the course of translation, * M~iteras (check items for faking 
good) were foun.d iifficult- to translate. For example, 
Thorl Insson reports that the M-iteras (especially tt5 item in 
"Newspaper" and «6 item in "Prisoner" dilemma story ) were 
perceived by subjects as a Stage 4 or a Principled stag£ ite». 
Such difficulties were also reported in Gendron's study, ri iteras 
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have another problem besides translation. According- to Gendron, 
if* he dropped out subjects who had M-score of 4 < the cutoff - 
point in the 3^story form ), he would have lost 37.2% of 
total subjects { '-usually 5-15% in American sample), A slightly 
higher than* expected dropout rate was alqo reported in Benor et 
al's study (1982, 17. 1%) and' Hau's study <1983, 16%).! The 
details of attempts at. making cultural adaptation are rarey 
reported ^ In Dickinson's study <1979) in Australia, several 
American uiords were changed into Australian equivalents. In 
flenor et al^ study (1982) , the name, nationality* occupation of 
the actors in the story were changed to fit the Israeli, petting? 
The effects of such changes were not systematically examined. 

B. Effects of ethnic background 

According to Kohlberg( 1971 ) , "all individuals in alx 
cuj.tur.es go through the same order or sequence of gross stages 
of^moral development; though varying in rate and terminal point 
of development" (p. 175). In the DIT studies, the effects of 
cultural background have been investigated in ^several ways.Ohe 
type of study selects two or more samples having different 
subcultural backgrounds and treat bhem as" representing levels of 
acultural dimension as an independent variable — for instance, 
Anglo-and Greek Australian (Hatson , 1983) and American and 
Japanese- American (Jacobson*, 1977). Another typfe Of study 
compares tl)e scores of subjects from one culture to those of 
another culture( usually , the USA) ^rr for instance, Hau(1982) 
and Park & Jo*yison( 1983) . . Seven of the 20 studies belong to the 
former category like Watson and Jacobson's study. 



Insert Table 2 here 



As shown in Table 2, the studies treat cultural background 
as an independent variable while holding the age and education 
level constant. Considering that the effect- of age~educat ^on 
factor on moral judgment development ( P-score ) approximates 38% 
of the total variance (Rest, 1979a, p. 110), the effect of 
cultural backgrodnd could be well delineated if they succeed in 
holding them constant. This control is not clear-in . 
Jacobson's( 1977) mother subjects and Deyoung' s( 1980) teacHer 
samples. Jn Ma's study (1980), subjects are not controlled in 
age. 

r 

In the seven studies, ten comparisons were made between 
ethnic groups: 6 comparisons of Western versus non-Western 
culture, 4 of Westerh versus Western cultures. Five out of six 
Western \iersus non-Western comparisons showed significant 
difference in P-score, while none of Western versus Western 
comparisons showed significant differences. In general, the DIT 
P^seore is more likely to diverge between Western and 
non Jdestern samples than between Western and Western samples. In 
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most of the comparisons, the closer the sample to American 
culture, the higher the P-score tends to be. That is, American 
subjects scored higner in P-scores than their respective 
counterparts in Mexico, Japan, and Saudi- Arabia. English 
adolescents did better, than Greek agemates. Anglo-Australians 
scored higher than miscellaneous Australian groups, 
Greek-Aust ralians , . and Asian-Chinese ( See Table 2 for the 
explanation of the four ethnic groups). 



C. ,Age/education trends 

Nothing is more crucial to a cognitive developmental 
construct than evidence of change over time from less advanced 
forms of thinking to more advanced forms <pf thinking. Age- trends 
in moral development have been used by Piasget and Kohlberg As a 
tnajor justification of their claims. /The cross sectional trends 
have been well* shown in many DIf studies as well, as is shown 
in Figure 1 for the American sample in the DIT manual (Rest, 

1979b). - ; 

I 



Insert Figure 1 here. * 



In the case of the American sample, one-way ANOVA of the 
four student^ groups showed an extraordinary strong developmental 
trends (p<.0001> and age-educati on. was found to accdunt for 38* 
of the total variance of moral judgment. The American's average 
DIT score tends to increase about 10 points with each increase 
in level of education (Rest, 1979, p. 110). Comparing the 
P-scores of American samples to those of other cultures, it is 
interesting that the American samples are ndt always the most 
advanced as* shown in Figure 1. / 

All six studies of non-American samples show clear 
developmental trends in P-score. This means that older and 
better educated subjects are likely to attribute mote importance 
to the higher stage issue statements (items). In Hau's 
study (1983), age/education accounts 12% and 23% (Omega-square ) 
of the varianfce of P and D scores respectively, suggesting 
somewhat lesser effect of age-education on moral reasoning 
development i,han in the American study. Even though statistical 
test were not performed, the general trends shown in Figure 1 
appear that the P-score increase with age-education, and that 
the rate cf development seems different \between Western 
(Americans, Icelandics, and Australians)* and Eastern sample 
(Koreans and Chinese). It^seems that the older and better 
educated the Eastern subjects, the more they lag behind their 
American agemates. As shown in Figure 1, in junior high school 
years the Eastern subjects' P -scores are either higher or quite 
close to that of American counterparts. But as they get older 
and more educated, the rate of increase in P score appears to 
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slou dfewn.This seems to be not the case with Icelandic and 
Australian samples. Such slow-down pattern was also indicated by 
Lei(1981) in his research on Chinese sample. However, this may 
be Vn averinterpretation of the data, because it. is not certain 
that the difierente of the rate is significant or not without 
any test cf significance between jHestern and rion-Uestern. 

In thf developmental trends, age and education are 
confounded each other especially in student samples, since the 
more educationally advanced subjects are invariably older. In 
American samples, education was found to before related than 
chronological ageto^the increase in J^score. Hau's<1983) and 
Uatson s<19830 studies directly compare the effects of age and 
education on the DIT scores. In Hau's study r when the ndueation 
was controlled, the correlation of' DIT scores<P and D jcore) 
with age were very Low ranging from -.17 to .22 (p <• 001). This 
means. most of the contribution of age-education to the DIT 

P-scdres(12% in P-score ,23* in D score) -are due to education 
and not age <p.43). 

In Watson's study <1983) al so , educational attainment was 
more strongly related to moral judgment level than age. He 
found a significant interaction effect between age and 
education suggesting that the picture is not as clear as that 
proposed by Rest <1979a). There, Rest suggests that adults in 
general do not show much advance in moral reasoning beyond tha v 
accounted for by thei'r level of education Ip. 112). Uatson< 196J) 
inteKprets the interaction as suggesting t*^t. a P-score in the 
low 30sw*epresents the minimal level achi evettsjyp adai ts with at 
leaB-fe^some higji-school education. Further advances in .the level 
of moral judgment beyond this point appears to b6 related to the 
experience pf higher education" (p. 60). 

Some variables related to education have been explored to 
see if they have any consistent relationship to the DIT scores. 
' Two studies address the relation of school type to the DIT 
scores (Beddoe, 1980; Deyoung, 1982). Benor, Notzer, Sheehan, £ 
Norman<1982) investigated the relationship of interview score 
gathe r ed from a medical school entrance examination with the DIT 
scores. Clarke(1978) studied the relative potency of teacher- 
related variables as a%predictors of ttife development of the 
children . Career interest and school speciality were also 
related to the DIT scores by Miller (1980) and Prahallada 
(1983). In those studies, DIT scores were found to relate 
\ significantly or at least nrarginally to the factors such as 
school type, medical school examination, teacher potency, and 
career interest of students. 

D. Sex difference 

The charge of sex bias in Kohlberg's test has drauwr much 
attention to the sex variable. While some researchers have 
c^d^raed that Kohlberg's model is biased against women because a 
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morality of /justice is not attuned to the thinking patterns gf 
women (Gilligan, 1977), other researchers have argued that there 
is little evidence for.the alleged sex-bia? Claim , because 
there are not significant and consistent sex difference found in 
Kohlberg's test % (Ualker , 1985) and in the DIT (Rest, 1979). 
Recently Thoma (in press) conducted a raeta-ajialysis on the ] 
acciimmulated DIT data of 56 samples comprising 6000 mal£ and > 
female Subjects.. He found that #f overall, and" at every * 
age/education level, females score significantly higher than 
males. But t*hc magnitude of the difference is sroall ft (p. 1) . 

Seven studies directly address this isSue. Most of the 
Chinese- and Korean subjects except Ma's study (1980) show sex 
difference in P-score and Stage 4 score with femalett. scoring 
higher Pscbres, and lower Stage 4 than male subjects. In 
Tsaing's study (1980), a sex difference is shown consistently 
through age 13 to 17 except at age 16. Two studies with 
Uescern samples (Jacoison, 1980; Watson, 1983) found no sex 
difference. i f 

It is interesting that Stage 4 score is higher -in male 
subjects in HaCKs Chinese and Park & Johnson's Korean 
samples,. One more interesting fact is that the sensitivity of 
P-score and D-score to sex difference is mixed . In Hau's study 
(1983), the P-score has more discriminating power over the 
D-score in detecting a sex difference. In Ma^s study (1980), 
however, the Kscore was found less sensitive than the J-score 
in his English sample( n=108) , but later uhen the test was 
refined with more sub jects(N=272) , the result u,as the oppositeCp 
<.10O). It was not explained whether this was due to just a 
measurement error or due to a true difference. 

In Pox's study, sex differences were shown in the M-score 
in both Greek and English subjects with male subjects having* 
almost double the score of females* Other studied do not have 
information on sex differences in M-score. Even" though there are 
significant sex difference in many studies, the actual magnitude 
of the difference is not large. By and large, the result of the 
seven studies seem to oe consistent with the conclusion of Thoma 
(in press ) . 



E. Correlations with other psychological tests 

The DIT has been found to have a consistent relationship 
with some psychological variables as predicted from theory. For 
example, the P-score is hypothesized to have a moderate 
correlation with IQ measures, while it is hypothesized to have a 
negative or inconsistent cor relation .with dogmatism and 
P scale. With American samples (Rest, 1979, pp. 198-201) , the DIT 
was found to correlate in the .20 to .50 range with measures of 
IQ, aptitude, and achievemement and to have moderate correlation 
with some scales of the California Personality Inventory (e.g.. 
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^Achievement via independence ,. 48; I nt^lectual efficiency, .42) 
and Omnibus Personality Inventory (e.g., Autonomy, 
. 47;CorapleXity , .45). 

Four of 20 studies carry some information on the 
relationship of DIT scores to other psychological tests. The 
trends are quite similar to those of American studies. The 
Chinese sample of Haii's study shows very similar correlation 
•between the P and IQ score< Raven" s 'Progressive Matrix, hence 
RPI1) to that ol the American sample. In Prahallada's study 
(1982) in India,, the P-score has no positive correlation with 
IQ neasures'of FjtPW at a significant level ( p< . 05) . Two cognitive 
measure*? of Thorlindsson^s Icelandic sample( ElaDorate Language 
Test and Role talcing ability) show somewhat low but significant 
correlation with P-score. In the case of personality 
variable's, Ha'sKtwo of California Personality Inventory 
subscales) and Prahallada's studies( Bell * s Personality 
Adjustment Inventory) sfcow similar but a little lower 
correlation between DIT scores and personality scores than 
reported for American samples. 

P. Religious affiliation. 

# * 

No significant difference hasfbeen found in the moral 
reasoning development among Protestants, Catholics, Jews, 
Buddhists, HosLems, and atheists in Kohlbergian studies 
(Kohlberg, 1971, p. 174; Kohlberg* and Kramer, 1976). The studies 
using the DIT aLso report no ^jgnificant difference in DIT 
P-score among religious r.f filiation (Getz,1983). But as shown 
in Lawrence's study(197f; Ernsberger, 1976), idiological ( or 
religious) commitments can ove'rride conceptual adequacy in 
making moral judgments. In this sense, the issue of religion to 
moral judgment development is not settled. 

Tl^ree studies investigated the relationship of religious 
affiliation and the DIT scores. In two studies (Watson, 
1983;Beddoe, 1980), religious attiliation made no significant 
difference in P-scores. This is consistent with other studies in 
Getz's review (1983). In Gendron's study (1981), direct 
comparison between the non-Christian and the Catholic group is 
not warranted because they are different in age-education 
level. Of interest is the seminarian and nun group ( Group 4 in 
Table 1), who are undergraduate Catholic students from the 
department of thelogy and religious studies. Their DIT P-scores 
are lower than those of other Catholic students (Group 3), the 
Catholic young professionals (group 5), and are not V *V 
significantly higher than the senior high students (Group 1 and 
2). Cohort effects may be a possible explanation because the 
cjroup covers a wide age range (23-63 years old). As Gendron 
suggests, another plausible explanation may be sought from the 
background and personalities of this sample: "most of the older 
subjects in the group are nuns, who were brought in a 
conservative environment, where Catholic moral teaching had a 
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clear and unquestioned answer for any moral problem. Strict 
adherence to official church teaching was required <p.3) ff . If 
the inference is correct, the religious factor should be 
regarded as' having profound etfects on moral judgment 
development. His study has an implication that the 
non-significant result of the previous studies on religious 
affiliation and moral development may be due to its superficial 
or misguided categorization of the religious variable. 

G. Urban-rural rai'lieu 

As Rest<1983) suggests, "the lack of paticipation in 
secondary i nsti tution- < e . g . , the national legal system and 
government , bureaucracies of industry) me Ices it less likely that 
those people will evolve the conceptual schemes of stage 4 , 5, 
and 6 that' deal especially with moral problems at this more 
abstract level" (p. 58). The urban-rural dimension may be a proxy 
indicator for the degree of participation in such institutions. 
Therefore, villgers dre hypothesized to score lower in moral 
reasoning development than town dwellers. 

Three studies (Thorlindsson, 1978; Park & Johnson, 1983; 
Ismail, 1976) directly compare the difference in DIT scores 
between urban and rural subjects. Only Korean subjects in Park & 
Johnson's study show significant difference between the two 
groups in the P- and Stage 3- score. Urban subjects score high 
in P score, but *rural subjects score higher in Stage 3. The 
difference was attributed to lack of cultural relativism and 
greater shame orientation of rural subjects. 

In Thorlinsson 's Icelandic study <1978), significant 
difference was found only in stage 4 score between male and 
female subjects of the rural sample. The researcher interprets 
it as indicating that "the recent change in women's role has not 
spread to the. village community". The female's high stage 4 
score is in contrast to t^e claim that stage 4 is more likely to 
be a male stage <Holstein. 1976). 

The result of the three studies can not be counted equally 
in weight because the samples used in the studies are quite 
different. In Thorlinsson' s study, the subjects are 14-15 years 
old students living in a village or city at the time of study. 
In Ismail's study (1976), the subjects were 20-29 year eld 
college students who had come to the United States for advanced 
study . Park & Johnson's subjects include students ranging from 
6 graders to college st ients living in a village or in a city 
at the time of study. Therefore as far as the samples are 
concerned, Park and Johnson's finding could be given more 
credit. 



I H. Delinquent behavior 
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The relation of woral Judgment to behavior is complicated 
and mediated by many factors. According to Rest's ^ur Component 
Model *1983), noral judgment is part of Component II : 
"therefore moral judgment* is one player in a large cast of 
players, and even if it is a star, it is not the whole show" 
(p. 600). As Blasi(1980) concludes, nevertheless, the 
significant, but not very strong relationship can hardly be 
denied. 

Delinquent behavior has been* found to have a significant 
relationship with measures of social cognitive development such 
afc Kohlbergs test, role-taking skills, and Piagefs Golden Rule 
task (Rest, 1983). In 'DIT studies, delinquents (male prison 
inmates) were found to score lower in the P-score than the 
general adult group (see Rest, 1979a, p. 188). In case of 
adolescent subjects (16 year olds), HcColgan( 1975) found that 
they were also significantly lower in P-score08 . 8)» than the 
normal group( 28.7) . 

Two of the 20 studies investigated delinquent adolescents 
in Br zil or South Africa. Some of the DIT scores were found to 
be related significantly to delinquency variables. It is 
interesting that Stage 4 score seems to have more consistent 
relationship with delinquency variables more than the P-score. 

Bzuneck (1978) assessed the discriminating power of the DIT 
between delinquent and non-delinquent adolescents from either 
intact or disrupted families. The average P-score of the four 
gjoups were not found significantly different. However, Stage. 2, 
Stage 4 and the A index showed significant dif fe nee among the 
tour groups. Delinquents from a disrupted (father absent) family 
scored higher on stage 2 than the other three groups. Eowever, 
this group was lower in the Stage 4 score than other groups. On 
theA-iidex , delinquent groups were higher in thp P-score than 
non-delinquent groups (p<.05). 

The average P-score of the Brazilian sample was quite low. 

The non-delinquent's P~score(18.7%) was lower than that of . 
delinquent 's(20. 1%) . Although the difference was not 

statistically significant, it needu^close examination. Ilaybe, 
the low P-score of non-delinquent group was due to their sample 
characteristics, for instance. In other words, the 
non-delinquents might not well represent the non-delinquent 
population of Brazilian adolescents. A good evidence is the 
forced match of age and education level of the non-delinquent 
group. Another reason can be sought in the low reading level 
of the both groupsCthe author assumed their subjects' education 
level was grade 4 to 5). 

Heyns et al.(1981) studied the relation of moral reasoning 
level to behavioral dimensions of juvenile delinquency. The 
subjects were 57 delinquent boys in a reformatory catering for 
severe cases. Their average P score was 21.06(%). The 
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researchers related GIT scores to four dimensions of delinquency 
which were facto/ed out from the Quay measures and subject's 
biographic data. P-score were found to correlate negatively 
with the sucial inadequacy dimension. Stage 4 was significant / 
associated with social orientation dimension. Such results were 
regarded as validating information for both the dimension of 
delinquency and the DIT. The non-significant correlation between 
anti-social egocentric • dimension and the P-score was, 
however, unexpected ontradictory to the previous findings 

such as reported by Pordor (1973). 

I. Familial/social factors • 

^Many studies have the kinds of faailial and societal 
factors related to moral development (c.f. # Edwards, 1981;Rest, 
1979). Although Piaget emphasized the influence of peers on 
chilren's p»oral development more than that of parents, Kohlberg 
acknowledges both. Peers or parents are important because they 
are the main source of role-taking opportunities. This 0 
opportunity varies with family and soc ~ty. Therefore, "Kohlberg 
jfostulates that differences in role-taking opportuni t ier aong 
cultures and socio-economic class can speed up or slow down 
development through the stages. Lower socio-economic classes in 
several cc^mntries seem to go through the stages slower than high 
SES groups (Kohlberg, 1969; Rest, 1983, p. 598)." 

Dickintfon(1979) investigated the relation of moral 
development to sample characteristics of family factors in the 
Australian setting. Eleven sample characteristics (father's 
occupation, parerVs education, country of origin, number of 
children, types of school ajttended, religious denomination, 
etc.) account for on\y 5* of the total P-score variance (p. 128). 
Best predictors were father's occupation, religion, sex, and 
father's education. The correlation of P-score among family 
members were significantly high( adjusted R-square^= . 79) . 

Tsaing(1980) explored the influences of child rearing 
pattern, family sizd, birth order, and SES on P-score. All but 
child- rearing pattern showed non-significant differences. Four 
types of child-rearing pattern were analyzed in relation to sex 
and four age levels! A ^significant correlation was found only 
at "high love-high induction" at age 14 male subjects ( r= 25, 
p<.05). 

The interaction among family mercbers was investigated in 
ralation to DIT scores. According to Dickinson (1979), Style 
A-type verbal utterance (rational and pers*isive) has 
significant correlation with the P-score over Style B-type 
verbal utterance ( imperative and authoritative). In addition, 
Thorlindsson (1979) studied the interaction of mother and child 
in relation to the P-score. Mother-child person oriented 
interaction (reasonable and reciprocal) has big iticant positive 
correlation with P-score over mother child position oriented 
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interaction pouer and authority). By rnd la^ge, the findings 
are consistent yith general expectation as Eduards( 198D) 
indicated: "the 'most successful parent are expected to be those 
verbally and overtly rational who encourage uarm and close 
relation uith children and who promote a democratic style of 
lif-e (p. 3D". 

J. An emic DIT 

Only one out of the 20 studies can be regarded as using an 
emic DIT in the sense that it Has not just a translated version 
of the original DIT for cross-cultural compardson but a full » 
adaptation of the DIT into the target culture uith no direct v 
intention of cross-cultural comparison. Villanueva (1982) 
constructed a measure of, moral judgment (The Exercise in 
Evaluating Issues, hence EEI), that is patterned after the DIT 
but has ioral dilemmas familiar to Filipinos and incorporates 
values cumnQnly accepted by them. Therefore, the EEI can not be 
considered just as a culti^ally adapted* and/or a translated 
version of the original DIT. It is interesting to see how 
similar or different the tuo measures are. The sample comprised 
5DD students from diverse educational levels ranging from 4th 
year high school students to graduate and seminarian students. 
The EEI scores corre 1 ite highly uith the scores of the original 
DIT, uith correlation coefficients ranging from . 63"^tti .91 
(average r=.8347, p<.Dl). The seminarians obtained the highest 
E-score ( EEI's equivalent of P-score in the DIT) among all the 
groups. Houev4r, though the average E-score increases as 
educational level goes up, the differences in mean E-score among 
the high school students, college students, and graduate 
students were not significant. Only the seminarian group 
differed significantly from the other groups. 

The reliability of EEI turned out quite high (r=.74-.91 in 
test-retest measure). The most significant single factor un i^h 
influences the EEI sco.s proves to be education (r=.4166). The 
next factor uas father's education (r=.1296). Uhen considered 
jointly, age and urban residence were the most significant pair. 
Though age did not appear as a significant single factor, it 
appeared significant in interacting uith other variables. Based 
on these findings, the researcher concludes that the EEI has 
very similar properties to that of the DIT, even though the 
average principled morality level uas quite lou compared to the 
American's P-score. It is soraeuhat surprising that, only 13% of 
seminarians and only 21 of graduate students are predominantly 
using prinqAied moral reasoning. Although, all in al-1 , the EEI 
reveals its^lose similarity to the original DIT, the lower 
discriminating pcwer among various educational-level groups may 
be a serious drauback in detecting and tracing the 
^developmental trends of Filipinos' moral judgment development. 
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III. Conclusion and perspectives 

There are tw0 najor kinds of questions of interest in 
cross-cultural studies. The first question concerns whether 
psychological theories developed in one culture generalize tu 
other cultures <e.g., is Kohlberg's stage theory universal? ). 
The second interest is in using cross-cultural variation to 
explore and illuminate relationships among psychological 
variables <e.g., do different social roles of women in different 
cultures effect their moral judgment development? ). 

In terms of the first question ol interest, cross-cultural 
studies are supposed to be "proving grounds". Because the main 
theme of the study is to see if the theoretical assumptions and 
its methodology work or not in another culture, no specific 
hypothesis related to the target culture is required. Just as 
in replication studies, the findings of these studies are 
compared and evaluated in terms of its similarity or difference 
to the findings of the original studies. 

In terms of the second questirfb of interest, however, 
cross-cultural studies have to adopt any Specific hypotheses 
concerning the relation ol some cultural factors or aspects to 
moral reasoning structure or level. In other words, because 
the main theme of the v studies here is to see what is the effect 
of the culture on moral reasoning development, specific testable 
hypothesis are required. In this case, therefore, the 
cross-cultural studies are no longer "proving grounds". The 
similarity- or difference across cultures are not just compared 
but explained in terms of what cultural factors makes the 
difference or similarity. 

Applying these two questions to th studies reviewed here 
as criteria for evaluation, I regard the findings as moderately 
satisfying the first criterion but not the second one. Based on 
the findings above, it may be possible to conclude that all in 
all the cross-cultural findings are comparable to American 
findings and Kohlbergian findings. The similarity of the 
findings among them are a good demonstration that the DIT has 
cross-cultural validity in detecting moral reasoning structure 
and its development in other cultures outsids the USA. 

However, considering that the sttfdies reviewed here are small 
in number<20>, not exhaustive in the cultures covered<15>, and 
by no means perfect in research design and analysis, the 
conclusions are iiot strong . The studies reviewed here can not be 
considered as having fully carried out the role as a " proving 
ground ". Pirst, they did not cover so broad a spectrum of the 
age-education sample as in the original DIT studies. Second, 
they d-id not lully explore the relations of convergent and 
divergent variables to the measures of the DIT. Third, most of 
the studies did not show the psychometric properties of the 
translated DIT to be comparable to those of the original^ DIT. 

I 
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For instance, if the DIT items had been biased against some 
cultural groups over the other cultural groups, then the direct 
comparisons of the DIT scores (e.tj. , P-score) night have 
resulted in errorneous conclusions. Fourth, most importantly 
there was no lonqitudinal evidence. 

One of important t}oais of cross-cultural study lies in the 
explanation of cultural effects on moral reasoning structure and 
its development. Most of the studies revieued here have not 
given much attention to this point. Feu tried to explain the- 
similarity or difference of their findings to that of American 
studies in terms of cul tmre i " Variation. Considering that most of 
the studies are the fi rst - attempt of using the DIT in their oun 
culture, it is no wonder that they had to be more involved in 
the issues of the first criterion. 

I |fegard that the validation and explanation roles of the 
DIT in ^cross-cultural studies should be pursued together. To 
strengthen the explanation role, it is important to formulate 
specific, testable, and meaningful hypothesis concerning the 
relation of the cultural aspects to moral reasoning structure 
and its development. In formulating such hypotheses, it may be 
necessary to have interdisciplinary perspectives including those 
of anthropology, ecology, history, and sociology. 
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r- .69 

- 


ni 


10. Mexico 

Mil ler( 1979) 

Pii 1 1CI \ I 7 1 7 t 


Mexican & 
US atudentai 




















Bilingual 

Mexican 

etudente 


37 


11 - 

l^ th 


19.6 




sex, 
age. 
career 


lone. 
English 


ni 


ni 




US atudents 
living in 
Mexico 


05 


n - 

12 th 


22.5 


- 


interests 








11. Philippines 




















V 1 1 1 anuev a 

emlc'i 

ci oun- 
tjeu ti ortdl 


& fcuu on is 

113 students 

MS students 

Collage 
etuiJonta 

H raduate 
a tudents 

Seminarian 


77 
70 

U2 

16 

23 


i3.e 

15.3 
20. H 

32.9 
26.0 


21.3 
2*. 8 
22.3 

23-5 
31.6 


7*5 
9.6 
7-5 

7.2 
8.9 


age. educ 

sex, 

urban- 
rural 

familial 
lac tor 


6-story >f orm 

ni, 

nl 

/ 


test-retest 

r« . 7 1 * v * . 9 i 

DIT A EEI 
r-.8j 


ontio 


12. Saudi Arabia 




















I Hmall( 1976) 

6 tlC , 


|((S t, Qa lift t 

utddenta In 
US college 


















cr^aa- 
BbCtlonal 


American 

Undergraduatt 
Graduate 

Saud 1 -A rah 1 an 


20 
20 


28 

20 


22.0 
29.5 


7.5 


culture, 

length of 
stay in 
USA, 


lone. 
English, 


nl 


nl 




Undergradua tt. 
Gra lua te 


21 
19 


28 
28 


10.5 


**.7 
6.2 


rural 








1*), South Afrloa 




















Ho j na e t a* 
(19R1) 

etif , 
r roytJ- 
nc* tl onal 


Delinquents 
(toys) 


57 


11- 

12 th 


21.1 


12.7 


demo- 
graphic 
data & u 
Quay Scale 


lone, 
Encl i ah , 


ni 


ni 


















Taiwan 




















Tu^ln ( -;(1980) 
iim Llunul 


Studenta i 
JR students 
,lr atudents 
SH atudettts 
SR students 


160 

16*> 

150 
172 


13 
1H 
16 
17 


20 Jl 

2f>. 8 


6.8 
6.2 


age, educ 
wax , SES, 
Kumlly u\t 

birth orde 
thlM- 
reari ng 


long, 

Chinese, 

Group 
diocuaalon 


I - 3 r > K 

y.* > 

• ' ' ' 


ni 



BEST COPY 

er|c 23 



I* 



,7 



Table 1 



"iomnnr 



'nl ture(country) . 



Sample 
dv&Lt lptlon 



Gendron 

(1981) 

otic, 

ctobb- 

wuitlonel 



15. Trinidad * 
Tobago 



Buddoe{l98l) 

otic, 
crosa- 
aec tional 



a t'.ol ic 
b t jJcuto I 
i»roup 1 
Group 2 
..roup 3 
i '.roup ^ 
Iroup 5 



k teacher's 

college 

students 

u- 

College I 

College*" II 
Collugs^III 
College'* IV 



210 



Urade 



16-18 
l6-l8 
20-26 
23-63 
22-30 



20 - 



Scure 


S.D 


Major 
variables 


Forms of DIT. 

Language, 

Translation 


Valldl ty/ 

RolUbllliy 

■• - 


Cul tural 
jul.iji t.i 1 1 on 




31.7 


11.5 


age. educ, 


short, 


nl 


M-ltems 


28.9 


10.7 

10.7 


religion 


Chinese* 
nl 






33. 


1*».6 












15.8 










35.1 




age, educ 


short 9 


nl 


nl 


25.0 
o 26.2 
9 26.7 

A 




religion 

achool 
type 


nl, 
nl 







Note i 



longi 6 dilemma story DIT 
short! 3 di lemma story DIT 



2. 



3. 



PR 
JR 

SR 
IIS 

nl 



i Primary school 
•Junior High School 
•Senior High School 
•High School 

•No information or information not available 

Anglo-Australian students. Students whose parsnts were born in Australia or who 

migrated from Britain. 
Greek-Australian students* Sludents parent* were of Creek descendents snd at 

least ona parent fcrew up in Greece arid came to 
Australia, after elghtoen years of age. 
Miscellaneous Australian i Students whose at least one or parents waa of non- 
English speaking background and did not meet the 
' \ criteria for inclusion in one of the Ai.glo- and 

' Greek-Australian group. 

Asian- Guineas • Students whose both or the parents were of Chinese 

distent, 

5, Only the synopsis or abstract or lbs study waa uued in this review. 



Bell's Personality Adjustment Inventory 

School I • The Sackler School of Medicine. T ] Aviv Univeroi ty( 3TA ) , where 



School TI » 



applicants are selected only on the basio of previous ocholuotlca 
achievement and performance on a psychometric teat. 
The faculty of Health Sciences. Hen-Uurlon Univeroi tyt BGU) .where 
applicants are selected on the bnnis of personal character! a tlca . 



Interpersonal skills srid orien to ti on toward the community ao cxpereaai 
hy previous behavior with down-played u» hoi autic achievement. 

ft. U3 teachersiAmerican teachers employed by a Department of Defense Overseas school 
located In Japan, a Ml»i<ile School. ... w 

US motheis .American-born wo«ten,who are wivta or American enlisted men or ofllcen 

or Department of Defense civilians serving in Japan. 
Japanese-born mothers* Jupanes-American women who are wives or American soldiers 
or D0I) civilians serving in Japan. 
9. emlc» A term coined by Pike<1966) to refer to an anthropological approach 
In which cross-cultural comparison is ruled out. 

10. LEI • The Exercise In Evaluating Issues 

11. Quay S'ulooi 3 lists and questional re w* re uut:d 

a. The Behavior Problem Checkl 1 ot( Quay A Parsons, 1972 ) -ft list 

b. The Checklist for vae Anulyslu of Llfa History Duta<Quay & Paroona 
1972)-A iiat 

c. The Personal Opinion Study(Quay 1 Parsons. 1972)- T list 



12. College Ti Oath olio Women's Tescher*s Colle^e(CWTC) 
Collets Hi Co; eminent Trulrtf College <fiTC) -Coed 
College Illi P0STC- coed 
rullefov IVi Maualca-f oed 



o 

ERIC 



BEST COPY 

2i 



Table i 

CulturaKethnic background ) effect 

V 



Study 


Sample 


age 


U 


r(#) score 


SD 


Significance 


Watson 
(1933) 


1 Anglo-Australian CS^ 

2 tiisc- Australian CS 

3 Greek-Australian C5 
h Asion Chinese CS 


18.3 
16.6 
18.5 
19.11 


20 
10 
10 
20 


*0.75 
41.C4 

34.00 

32.73 


6.97 
13-55 

9.17 
12.73 


1 & 2 MS 

1 & 3 * 
14 4 * 


Wilier 
(197^) 


American H3 
Mexican H3 


17 
17 


35 
31 


37.6 
32.8 


- 


N3 


Fox(l9B2) 


Er.glich H3 


17 


' 33 


33.9 








Greek HS 


17 


18 


320 






Jaccbsori 
(1977) 


1 American-horn motherscj 

2 Japanese-born mothers 

3 Children of Japanese- 
torn mothers MS 

4 Children of American- 
born mothers M3 


- 

10->l4 
10-14 


15 

39 

21 


28.26 
18.51 

2^.57 


14.14 
12.0 
6.90 

8.36 


14 2 • 

3 & 4 • 


WaU9b0j 


English HS 1 
Chinese HS 


15-2 
17-1 


loS 

78 


26. 11(16. o)~ 

27i9( isor 


14.0(4.2) 

12.5(3.2) 


P-scorei N3 
CD-score) 1* 


Ismail(l97to 


US undergraduates 
US graduates 


28 
28 


20 
20 


22.00 
29.15 


7.^5 -> 


- 25.58(8.70). 




C3udi Undergraduates 
Gaud graduates 


28 
28 


21 
19 


15.71 
18.52 


4.7*^ 
6.22-^ 


-l6.95'5.64)J 


Deyoung 


US English teacher 




3° 






ns 


(1930) 


Japanese college 

teacnera 




17 









li See Table 1 for detailed explanation 

* l P <- 05 

tiSiNon/cignif icance 
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Figure 1 

Afire-education trends of ?(%) score 



P<*) 
Score 



is 

to 
3 o 



Iceland^? 



Sib 



U.S. A 



Australian ^ 




Grade 



Id 14- 
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:'.i.th Jcnooi 



ir 
1 



16 
le 



n 
1 1 



if 



oenior 
Hign 3cnool 



Graduate- Adul 
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Moral development < Yong Lin Moon) 



A Review of Cross-cultural Studies on Moral Judgment 
Develcpment Using the Defining Issues Test 

n 

Yong Lir^ Moon } 
University of Minnesota 



< Abstract ) 



Twenty cross-cultural studies on moral judgment development 
using the Defining Issuer Test <Df^> were reviewed with respect to 
cross-cultural validity, age/educa££on trends, gender difference, 
correlations with other psychological factors, religion, 
urban-rural, delinquency, and familial/societal factors. By and 
large, results indicate that th£ DIT has similar psychometric 
properties (factor st ructmT^r^internal consistency, and reliability) 
and construct validity in non American cultures. However, the rate 
of development and the strength of relationships with other 
variables were found to differ across cultures, especially between 
Western and non-U^stern cultures. It is suggested that fine grained 
research is needed to explain (instead of describing) the 
differences and similarities in terms of cultural causation. 
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